Statistical Methods for Functional Metagenomic Analysis Based on Next-generation Sequencing Data
نویسندگان
چکیده
Metagenomics is the study of a collective microbial genetic content recovered directly from natural (e.g., soil, ocean, and freshwater) or host-associated (e.g., human gut, skin, and oral) environmental communities that contain microorganisms, i.e., microbiomes. The rapid technological developments in next generation sequencing (NGS) technologies, enabling to sequence tens or hundreds of millions of short DNA fragments (or reads) in a single run, facilitates the studies of multiple microorganisms lived in environmental communities. Metagenomics, a relatively new but fast growing field, allows us to understand the diversity of microbes, their functions, cooperation, and evolution in a particular ecosystem. Also, it assists us to identify significantly different metabolic potentials in different environments. Particularly, metagenomic analysis on the basis of functional features (e.g., pathways, subsystems, functional roles) enables to contribute the genomic contents of microbes to human health and leads us to understand how the microbes affect human health by analyzing a metagenomic data corresponding to two or multiple populations with different clinical phenotypes (e.g., diseased and healthy, or different treatments). Currently, metagenomic analysis has substantial impact not only on genetic and environmental areas, but also on clinical applications. In our study, we focus on the development of computational and statistical methods for functional metagnomic analysis of sequencing data that is obtained from various environmental microbial samples/communities.
منابع مشابه
Statistical Approach of Functional Profiling for a Microbial Community
BACKGROUND Metagenomics is a relatively new but fast growing field within environmental biology and medical sciences. It enables researchers to understand the diversity of microbes, their functions, cooperation, and evolution in a particular ecosystem. Traditional methods in genomics and microbiology are not efficient in capturing the structure of the microbial community in an environment. Nowa...
متن کاملGenome analysis An Informative Approach on Differential Abun- dance Analysis for Time-course Metagenomic Se- quencing Data
Motivation: The advent of high-throughput next generation sequencing technology has greatly promoted the field of metagenomics where previously unattainable information about microbial communities can be discovered. Detecting differentially abundant features (e.g., species or genes) plays a critical role in revealing the contributors (i.e., pathogens) to the biological or medical status of micr...
متن کاملMetagenomics: Tools and Insights for Analyzing Next-Generation Sequencing Data Derived from Biodiversity Studies
Advances in next-generation sequencing (NGS) have allowed significant breakthroughs in microbial ecology studies. This has led to the rapid expansion of research in the field and the establishment of "metagenomics", often defined as the analysis of DNA from microbial communities in environmental samples without prior need for culturing. Many metagenomics statistical/computational tools and data...
متن کاملAnalyzing Metagenomic Data: Inferring Microbial Community Function with Mg-rast
Application of massively parallel throughput DNA sequencing technologies to the generation of metagenomic datasets from environmental samples is presently transforming the field of microbiology. Whereas traditional (Sanger-based) DNA sequencing technology imparted a high economic cost on data generation, the development of “next-generation” technologies now make the large-scale generation of se...
متن کاملMetaSim—A Sequencing Simulator for Genomics and Metagenomics
BACKGROUND The new research field of metagenomics is providing exciting insights into various, previously unclassified ecological systems. Next-generation sequencing technologies are producing a rapid increase of environmental data in public databases. There is great need for specialized software solutions and statistical methods for dealing with complex metagenome data sets. METHODOLOGY/PRIN...
متن کامل